Detecting ongoing events using contextual word and sentence embeddings

نویسندگان

چکیده

This paper introduces the Ongoing Event Detection (OED) task, which is a specific task where goal to detect ongoing event mentions only, as opposed historical, future, hypothetical, or other forms events that are neither fresh nor current. Any application needs extract structured information about from unstructured texts can take advantage of an OED system. The main contribution this following: (1) it along with dataset manually labeled for task; (2) presents design and implementation RNN model uses BERT embeddings define contextual word sentence attributes, best our knowledge were never used before detecting in news; (3) extensive empirical evaluation includes (i) exploration different architectures hyperparameters, (ii) ablation test study impact each attribute, (iii) comparison replication state-of-the-art model. results offer several insights into importance indicate proposed approach effective outperforming baseline models.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Word Embeddings Model for Sentence Similarity

Currently, word embeddings (Bengio et al, 2003; Mikolov et al, 2013) have had a major boom due to its performance in di erent Natural Language Processing tasks. This technique has overpassed many conventional methods in the literature. From the obtained embedding vectors, we can make a good grouping of words and surface elements. It is common to represent top-level elements such as sentences, u...

متن کامل

Detecting Most Frequent Sense using Word Embeddings and BabelNet

Since the inception of the SENSEVAL evaluation exercises there has been a great deal of recent research into Word Sense Disambiguation (WSD). Over the years, various supervised, unsupervised and knowledge based WSD systems have been proposed. Beating the first sense heuristics is a challenging task for these systems. In this paper, we present our work on Most Frequent Sense (MFS) detection usin...

متن کامل

Detecting Cross-Lingual Plagiarism Using Simulated Word Embeddings

Cross-lingual plagiarism (CLP) occurs when texts written in one language are translated into a different language and used without acknowledging the original sources. One of the most common methods for detecting CLP requires online machine translators (such as Google or Microsoft translate) which are not always available, and given that plagiarism detection typically involves large document com...

متن کامل

Word Embeddings and Their Use In Sentence Classification Tasks

This paper has two parts. In the first part we discuss word embeddings. We discuss the need for them, some of the methods to create them, and some of their interesting properties. We also compare them to image embeddings and see how word embedding and image embedding can be combined to perform different tasks. In the second part we implement a convolutional neural network trained on top of pre-...

متن کامل

Word embeddings and Global Preference for Contextual Suggestion

In this paper we describe our effort on 2016 Contextual Suggestion Track. We present a new ranking model that captures both global trend of interests as well as contextual individual preference. We trained a regressor on common users data thus it can prioritize popular places and categories. In order to model individual user preference, we introduce word embeddings to represent both user profil...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Expert Systems With Applications

سال: 2022

ISSN: ['1873-6793', '0957-4174']

DOI: https://doi.org/10.1016/j.eswa.2022.118257